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Abstract 

The problem considered is the computation of an infinite product (com- 
position) of Lie transformations generated by homogeneous polynomials of 
increasing order from a given convergent power series. Bounds are computed 
for the infinitesimal form of Lie transformations. The results obtained do not 
guarantee convergence of the product. Instead, the optimum truncation is 
determined by minimizing the terms of order n + 1 that remain after the first 
n Lie transformations have been applied. 
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I. INTRODUCTION 



A method based on an infinite product (composition) of Lie transformations (exponenti- 
ated vector fields) generated by homogeneous polynomials of increasing order was developed 
a long time ago in order to efficiently perform perturbative calculations in Hamiltonian sys- 
tems when the small parameters are the dynamical variables themselves PJ. In particular, 
it was shown in Ref. M that the product can be computed from, or be used to compute, 
a power series in the dynamical variables. The relation between Lie transformations and 
power series, however, was established only at a formal level, that is order by order. 

More recent work has provided firm bounds on the results that can be obtained using 
the method. In Refs. [@,[HJ, for example, a variant of the method is applied to the problem 
of bringing a Hamiltonian function or a Hamiltonian vector field to normal form. In Ref. || , 
and in Ref. |4[] for Hamiltonian systems, sufficient conditions are given on the coefficients 
of the polynomials and on the domain of the dynamical variables such that the infinite 
product of Lie transformations is convergent. In this paper we turn to the construction 
of Lie transformations from a power series. Assuming that the power series has a finite 
domain of absolute convergence, we obtain bounds for the norms of the vector fields that 
are computed from such a series and develop a procedure for determining the optimum 
truncation of the product of Lie transformations. (Note that the analogous problem with 
power series for Refs. f|| and |§ is the one in which only the first nonlinear term in the series 
is nonvanishing.) As in Ref. f|, we do not require that Lie transformations be symplectic 
(i.e. that they arise from Hamiltonian systems); rather, the vector fields are taken to be 
arbitrary homogeneous polynomials. 

In Section |I| we introduce notation, which for quantities that appear in both is the same 
as in Ref. ||, and write down an expression for the coefficients of the polynomials in terms of 
the coefficients of the power series. Section [HI| contains two lemmas which allow us to pass 
from the expression for the coefficients to an inequality in the form of a recursion relation 
for the norms of the vector fields. In Section [IV] we then use the recursion relation to obtain 
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a bound on the norms, which is the main result of the paper. We also provide there an 
asymptotic expression for the bound valid in the limit n — > oo, where n is the order of the 
polynomial. The question of optimum truncation of the product is considered in Section [V], 
and a summary of the results is given in Section [VT|. 

II. PRELIMINARIES 

We work with the transformation M. formally defined by 

Mz = e L2(z) e L3{z) ...e Ln(z) ...z, zeC d . (2.1) 
Here L n is a vector field 

L n {z)-j:gf\z)^ (2.2) 
j=i uz i 

and gf^ a homogeneous polynomial in z of order n, 

gf\z) = E (2.3) 

\r\=n 

The subscript r stands for the collection of indices r 1} ...,r d , |r| = f r x + ... + r d , and 
z r = f z r i . . . z r / . The exponential of L n is given by the usual infinite series 



e 



L„(z) 



1 



Y,-A L n(z)Y, (2-4) 



s=0 S[ 



where s = corresponds to the identity transformation. In the definition of A4 the linear 
transformation has been set equal to the identity, as its computation is not germane to the 
problem at hand. The designation of Eq. (|2.1| ) as formal, on the other hand, reflects the 
fact that we have not specified a domain in C d , if such one exists, on which the infinite 
series of Eq. ( |2.4| ) are convergent for n — 2, 3, — We also define M. n as the product of Lie 
transformations of the form ( |2.1| ) truncated at order n. 

The properties of Lie transformations and their use in perturbation calculations are not 
discussed further in this paper. The interested reader is instead directed to Refs. [[]] for a 
sampling of the surveys of the subject. 



Suppose we are given the power series 



i=2 \r\=i 

which has a nonvanishing domain of absolute convergence, denoted here by D. (T> evidently 
includes the origin.) We are going to examine the construction of vector fields L n chosen in 
such a way that M.z and P(z) agree order by order in z. For the moment we do not specify 
the domain over which the agreement occurs. A lower bound on this domain as a function 
of n is given in Section [V|. In parallel with setting the linear transformation in Ai equal to 
the identity, we have assumed that to first order in z P(z) = z. 

For a vector v, regardless of the vector space, we define the norm \\v || by 

\\v\\ = max |^|, (2.6) 

where | • | stands for the modulus. For brevity we denote the norm of z by x, x = \\z\\. We 
also define the quantity otf^ by 

af= E \^\, (2.7) 

\r\=n 

and a n by a n = \\a^\\. The following relation holds: 

\\L n \\ = max\gj n \z)\ = max | E a^^l — o n x n (2.8) 



\r\=n 



In the subsequent sections we will obtain a bound for a n which will thus enable us to place 
a bound on ||L n ||. 

The first step is to write down an expression for the coefficients in terms of the 
coefficients b^ 1 . We expand the Lie transformations into power series and match terms of 
the same order in z to get 

E ^TT^=£^- (2-9) 

[n]=n-l i2 in ' \r\=n 

The symbol J2[ p }= q is defined as a sum over s 2 , . . . , s p with a condition, 
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E = E • (2-io) 

[p]=q s 2 ,...,s p >0 

s 2+ 2s 3 + -" + (P _1 ) s P=<? 

Note that the operators and Ly* i j do not commute, so the ordering is important. 

In Eq. Q2.9| ) s n can take on only the values of and 1. Together with the fact that 
L n z k = 9k ( z )i this allows us to transform Eq. (|2.9| ) into a recursion relation for 

ag? = &$, (2.11a) 

rS2 t s n-i 

a%! = b$-% E , ^ n>3, neN, (2.11b) 

[n— l\=n— 1 

where == ^ "^1 ■ (Throughout the paper N is taken to include 0.) This is the starting 
point for the computation of estimates for a n . 



III. A RECURSION RELATION FOR NORMS 



With the definition 



/3„ = max( £ (3.1) 

|r|=n 



Eq. (FTTHj ) yields 



E^r 7" «2 T S n — 1 

2 \ -« -^2 • • • 
< , ^ "H r S iS2 -... Sn _ l! ^ 

[n— l]=n— 1 z " 1 



r =n 



< /3 n + max I E 



a: l% 2 ... Lf"- 1 



J n-1 



n [n— l]=ra— 1 z " 1 



< /?„ + 2 -r^ r m P E I ~\ L 2 ■ ■ ■ L s n-lz k (3.2) 

[n-l]=n-l ^ • • • s„-i! fc |r|= Jr! 
Note that the component of the vector on the right side is determined by the component of 
z and is labeled here by k. Our goal is to express the right side of the last inequality in ( |3.2|) 
in terms of a%, . . . , a n -\- We accomplish this through two lemmas (the second one will also 
be used in Section [VP . 

Consider a vector function F whose components are homogeneous polynomials, 



and for which 



E 1 < k < d ' 



\i\=i 



(3.3) 



^(J2\f$\)<<f>i 
\i\=i 

for some G K + . Define the quantities c, m, and B by 



(3.4) 



E 



c tk (F,s 2 ,... J s n )z t = L s 2 K..L s n -Fl n \z 



\t\=m((f>ix l ,S2,---,s n ) 



(3.5a) 



£(<^, s 2 , . . . , Sn ) x m(^,s,,.. >Sn ) = ( X i a2 ±y* . . . (x-a^Y^x 1 . (3.5b) 

dx dx 



Here use is made of the obvious fact that the power of x in (|3.5b|) is the same as the power of 



in Q3.5a|) . Evidently B((f>ix l , s 2 , ■ ■ ■ , s n ) is a nonnegative real quantity and m((j)ix l , s 2 



S n ) 



is a nonnegative integer. The arguments of c, m, and B have been chosen to be rather 
explicit, so that the notation is sufficiently general for the manipulations that follow. When 
referring to a power of only one vector field, on the other hand, we drop the subscript on 
the summation index, denoting it by s, and replace the argument <pix l of m and B simply 
by I. The following holds: 

Lemma 3.1. For all s G N and I G N 



max 

k 



E \ct k {F,s)\)<B{l,s) 

\t\=m(l,s) 



(3.6) 



and 



\L' n F®(z)\\<B(l, 8 )x 



m(l,s) 



(3.7) 



This lemma is given in Refs. though its proof is only outlined there. In the Appendix we 
provide a more complete proof (which is an extension of the relations derived in Appendix 
A of Ref. [§). 

The product of operators of the form Lf can now be bounded by the lemma below. 
Lemma 3.2. For all n > 2, n G N and all functions F of the form (|3.3|) , I G N, 



max \c tk (F,s 2 , . . . ,s n )\ < B((fi l x l ,s 2 , . . . ,s n ) (3.8) 

\t\=m(<j>ix l ,s 2 ,—,s n ) 

and 

\\L? . . . L s »FV(z) II < B&x 1 , s 2 ,..., Sn ) x ^,B2,.,s n )^ (3 9) 



Proof is by induction on n and is straightforward. For n = 2 inequalities (3lj|,3.9) are 



the same as inequalities ( |3~6| , |3.7|) and thus hold by Lemma 3.1. Assume now ( |3.8| , |3.9|) hold 
for a fixed n and all functions F of the form (|3.3|) . Then 

\\L?...L?L2KFW(z)\\ = \\Lf?...L%F«'\z)\\ (3.10) 

where 

FP(z)= £ Qfc (F, Sn+i y (3.11) 
\t\=v 

with V = ns n+ i + I and, by Lemma 3.1, 

max \ctk{F, s n+1 )\ < B^x 1 , s n+1 ). (3.12) 
1*1 — «' 

Use of the induction assumption yields 

max V \c tk (F,s 2 ,...,s n )\<B(B((j)ix l ,s ri+1 )x l ',S2,...,s n ) (3.13) 

\t\=m(B(4nx l ,s n+1 )x l ,s 2 ,...,s„) 

and 

||L* 2 . . . L?F<- l '\z) || < B(B(<p lX \ s n+1 )x l ',s 2 , . . . , ^(^Miy*^). (3.14) 
By unfolding the definitions of m and B we get 

BiBifax 1 , S n+1 )x l \ 8 2 ,..., Sn ) x m(B(^,s n+l)x '',s 2 ,...,s n ) 
= {x2a2 l^ )S2 ' ' ' i^n^B^iX 1 , S n+1 )x 1 ' 

= Bi^X 1 , S2,..., S n , Sn+l)x m{^\s2,...,s M )^ (3 _ 15) 
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and so replace B(B(4>ix l , s n+ i)x 1 ' , s 2 , ■ ■ ■ , s n ) by B(<f>ix l , s 2 , . . . , s n , s n+ i) and 
m(B((j)ix l 1 s n+ i)x l , s 2 , ...,s n ) by m((j)ix l , s 2 , . . . , s n , s n+1 ) in inequalities (|3.13|,|3.14|) . With 
the further replacement of c t k(F, s 2 , ■ ■ ■ , s n ) by c t k(F, s 2 , . . . , s„, s n +i), the proof is complete. 

□ 

We can now make progress with inequality (|3.2| ). Since 

id: 



max 



J2 \-jL s 2 2 ■ ■ -L s n -i z k =max^ \c rk (z, s 2 , . . . , s„_i)|, (3.16) 



k 

\r\=n 



k ,V 1 r! 

\r\=n 

use of inequality (|3.8| ) yields 

a n <Pn+ — — jB(x,s 2 ,...,s n -. 1 ). (3.17) 

[ n _l] =n _l S 2 S»i-1- 

Note that the apparent dependence of the right side of (|3.2| ) on z (or \\z\\) has disappeared, 
as it should. 

The final step is to obtain an explicit expression for B(x, s 2 , . . . , s n _i), which requires 
the evaluation of the right side of Eq. ( |3.5b| ) for / = 1, (f>i = 1. First we note that 



j ( n l)s-p ( s(n-l)+p 

(x n 4-Yx p = -, V ; x p+8(n - 1) , (3.18) 

V dx ) r(^) 

where we take, as it is sufficient for our purposes, n, s, and p to be integers, with n > 2, s > 
0, and p > 1. The relation ( |3.18[) is easily proven by induction on s. Repeated use of (p. 18 
on the right side of Eq. ( |3.5b|) then leads to 



B(x, s 2 ,..., sn-i) = a?(2a 3 ) 3 • • • {{n - 2)a n _ 1 )'"- 1 

p / l+s n _i(n-2) A p / l+Sn-l("-2)+s n _2(n-3) \ p / l+s n _i(n-2)-| hs 

x V "~ 2 / V n ' 3 / V 1 y_ (3.i9) 

p/^^_\ y( l+Sn-l(n-2) \ p/ l+s„^i(n-2)+-+2g 3 X 



n—2 / V n— 3 y \ 1 

which is the desired, though admittedly cumbersome, expression for B. 
With the definitions r\ n = na n+ i, r n = n(3 n+ \, 



Q m = 1; m = n - 1 

n— m— 1 

Q m = 1 + ^ (n - j)s n - j+1 ; 1 < m < n - 2, (3.20a) 
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and 



Q(s 2 , ...,s n ) 



m=l *m+l! J- I m J 

inequality ( |3.17 ) and Eq. ( 2.11a ) become 



(3.20b) 



Vi = n 



(3.21a) 



Vn < r n + n £ VT ■■■Vn-iG(s 2 ,...,s n ); n>2, n£N. (3.21b) 

[n]=n 

In the next section we will use these relations to get a bound for rj n . We call attention 
to the interesting fact that the relations ( 3 .2 If) , and hence the results that follow, do not 
depend explicitly on d. The dimensionality of the space enters only through the definition 
of quantities r\ n and r n . 



IV. BOUND FOR rj N 

Inequality (|3.21b| ) is a complicated relation between rfs and r's. The reader is invited 
to show that attempts to establish simple estimates for rj n , such as rj n < K n or r\ n < K n n\, 
K G 1R + , by induction from ( |3.21b| ) fail. (For the latter case note that the sums over s always 



contain the term s% = s n = 1, Si — 0; 3 < % < n — 1, for which £7(1, 0, . . . , 0, 1) = ^zj-) The 
following gives a bound for 7] n . 

Theorem 4.1. Let K and define the quantities h n by 

n 

hi=l; h n = H(l + 21^(j n>2, nEN. (4.1) 

3=2 

Then 

^ n < ^/i- (4.2) 

PROOF. First, since P(z) of Eq. (|2.5| ) is a convergent power series, \b^)\ is bounded by 
an exponentially growing function of n. As 
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Pn<( n + d max \b%\, (4.3) 

\ 12 J rk 

f3 n is also bounded by an exponentially growing function of n, and so is r n . Thus the quantity 
i_ 

K = max„ Tn for n > 2, n G N, exists and is finite. 

Next, we manipulate the ratios of T functions that appear in Q. For 1 < m < n — 1, 

wP . = (Sm+1 - 1 H s m +i - 2 H ) 

r(^) m m m 

< (s m+1 - 1 + — 

m 

= -^—[m{ s m+i - 1) + Q m ] Sm+1 

< — - — (n + 1 - m) Sm+1 . (4.4) 

- m s m +i v ' v > 

The last inequality makes use of ms m+ i + Q m < n+1, which follows from the condition on 
the sums over s. Hence, 

n S2 (n - 1Y 3 ■ ■ -2 s " , , 

g( *-- s -) ^ 2 .... 3 ;,pJ... („-!)- < 4 - 5 ' 

and inequality fl3.21b|) becomes 

Tn + n E (^r^- iw--^,)- , (4 . 6) 

^ S2 !--. S J? 2 .--(n-l) s " 

To proceed further we establish the following statement. 
Lemma 4.1. For n > 2 and 1 < m < n — 1 

(n-m + l)^h m <2^h n -i- (4-7) 
Proof of Lemma 4.1 is effected in five steps. 

(i) For the case m = n — 1 ( |4.7|) obviously holds. Since m = n — 1 is the only value of m 
when n — 2, it remains to consider n > 3, 1 < m < n — 2. 

For further manipulations it is useful to denote the ratio of the left and right sides of 

(B by G, 

G{n, m) = = ( "" m + 1 ^ r . (4.8) 



2—h n . x 2— n^+i(i + 2— u - w 
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We thus need to show that G(n,m) < 1. 

(ii) Let m = n — 2. For n = 3 the direct calculation shows that G(3, 1) = 0.95, whereas 
for n > 4 we have 



G(n,n- 2) 



3— 



3— 



< 



— < — r 

2^[1 + 2^(n -2)]~ 2~(l + 2^)^ 
3 \ n— 2 / 3 \ — o 1 



2n-3 



2" 



(4.9) 



The last inequality uses > f which evidently holds for n > 4. It remains to consider 
1 < m < n — 3; n > 4. 

(hi) Let m = 1. By direct calculation G(4, 1) = 0.75, whereas for n > 5 we manipulate 
the product appearing in the definition of G to get 



n-l 



n— 1 



^ I n-l 1 

J] (1 + 2 A(j - 1))J = expE - log(l + 2^(j - 1))] > exp[£ - log(2j - 1)] 

j=2 j=2 J j=2 J 

> exp[^— ^ log(2n - 3)] = (2n - 3)^f . 



(4.10) 



The last inequality follows from the monotonic decrease of the summand as a function of j. 
We provide here a brief justification of this argument which, with slight modifications, also 
appears in step (v). Treating j as a continuous variable, we have 



d 



tM2j-i: 



(2j-l)(l-log(2j -!)) + ! 



(4.11) 



dj ./'-'( 2/ - 1) 

For j = 2 the value of the numerator is —0.70, and this value obviously decreases with 
increasing j. 

Returning to the bounding of G, since n > 5 we write 2n — 3 > |n and use inequality 



G(n,l) < 



1 



_, 1 



'T_\n-22_ 

> 5 < n 



(4.12) 



Note that if n is treated as a continuous variable then 



d 



(ir 2 l soar, 7, 1 

log (7 



(in 



49n 



n 



(4.13) 
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and so for n > — Upr = 2.97, the denominator is a monotonically increasing function of n. 

Ms J 

At n = 5 it takes the value 1.10. Therefore, G(n, 1) < 1 for n > 5. It remains to consider 
2 < m < n — 3; n > 5. 

(iv) By direct calculation G(5, 2) = 0.52, and we are left to explore only 2 < m < n — 3; 
n > 6. 

(v) Now we can consider the remaining values of m and n. Proceeding in analogy with 
Eq. (14.101) , we write 

n-1 

t — r ^ 3 , . , 1 . . n — m — 1 n — m — 1 

J] (l + 2~(j-l))J > (2n-3)^^- >n^^. (4.14) 

Therefore, 

G(n,m) < (U - m tl ] : < -L n ^("-^-^). (4.15) 

We now examine the parabola 

V{m) = m 2 — m{n — 1) + n — 1, (4-16) 

where m takes on all real values. Zeroes of Vim) are located at 

m +/ _ = -[n - 1 ± \j{n- l) 2 -4n + 4]. (4.17) 

The quantity under the square root satisfies 

n 2 -6n + 5 = (n - 4) 2 + 2n - 11 > (n - 4) 2 for n > 6, (4.18) 

and so m_ < | and m + > n — |. Therefore for 2 < m < n — 3, V(m) < 0, which gives 
G(n, m) < 1. 

Putting together the results of (i)-(v) completes the proof of Lemma 4.1. □ 
To finish the proof of Theorem 4.1 we carry out an induction on n. For n = 1 we have 

Vi = ri<K, (4.19) 

which verifies ( |4.2|) . Assume now that (|4.2|) holds through n — 1. Then inequality ( [4.6|) gives 

12 



<T n + nK n Y [2^r/i n _ 1 ] S2+2s3+ - +{n - 1)s " 



s 2 \ ■ ■ -s n \l S2 ■■•{n- l) s « 



[n]=n 

Krn + nK^h^ J2 



s 2 \ ■ ■ -s n !l S2 • • • (n - l) s " 



1 



(4.20) 



s 2 \ ■ ■ -s n \l S2 • • • (n- l) s ™ 

where we have used the condition on the sum to sum the power of the summand. The second 
inequality follows from Lemma 4.1. The remaining sums over s nicely sum to 1 — -, as can 
be demonstrated using Cauchy's identity || 

1 



E 



1. 



[n ^ =n s 2 ! ■ • • s n \s n+i \l S2 ■ • • (n - 
Substitution of this result into Eq. ( j4.20|) leads to the inequalities 



(4.21) 



Vn < r n + K n 2^hl_ x {n - 1) < K n 1 + ^h n n _ x {n - 1 



< K r 



/ in _ 1 (l + 2^r(n-l))> 



K n hl 



(4.22) 



For the second inequality we have used K n > r n and for the third one h n ^% > 1. This 
completes the proof of Theorem 4.1. □ 
The large-n behavior of the estimate ( |4.2| ) is not easy to discern. We thus provide an 
asymptotic expression for the result. First we convert the product appearing in ( (4.1[ ) into a 
sum by taking the logarithm of /i™, and then use the Euler-Maclaurin Sum Formula [[J to 
get 



log =$>(*;) 

k=0 
1 



6(m)+ / 0(t)db + ci + XX -1 



gj+i #fl(m) 
(j + 1)! dm 



m — > oo, 



(4.23a) 



where 



9{k) 



1 fc+2 

log (l+2*+r (& + !)), 



(4.23b) 



k + 2 

B n are the Bernoulli numbers, m = n — 2 and Ci is a constant to be determined. The integral 
of 9 evaluates to 
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r m 1 o 11 1 

/ 6{t)dt= [log(m + 2)] 2 + log21og(m + 2) + c 2 + ^-(--log2) + 0(— ), (4.24) 

where c 2 is given by c 2 = —0.577765 . . .. The terms in the sum over j, on the other hand, 
are of order los ^ and can be neglected. Nevertheless, for the numerical computation of the 
constant c\ we have used the j = 1 term in order to improve the numerical convergence of 
the procedure (the j = 2 term is zero since B3 — 0). The resulting asymptotic expansion 
for the bound on rj n reads 



^n(lo g 2 + Ilogn) xexp [Q M^ 

2 n 
The constant C3 is given by C3 = exp(ci + c 2 ) and takes the value C3 = 0.857695 . . .. 



K n h n n ~ (if C3 )"/- w "^ 2 +|i«s«) x exp [OF^)l; n -> 00. (4.25) 
V 2 n 



V. OPTIMUM TRUNCATION 



In Ref. [[J we have shown that sufficient conditions for convergence of Ai n z as n — > 00 
are that r/ n be bounded by an exponentially growing function of n and that z be restricted to 
a suitable domain around the origin. Yet, inequality (|3.21b|) is not consistent with rj n < K n , 



K G M + . Instead of considering convergence, then, we turn to the asymptotic properties of 
A4 n z. In particular, we examine the question of optimum truncation. 

The most natural quantity to optimize is the difference between P(z) and M. n z. Denoting 
this quantity by R(n), we have 



R(n) = \\P(z) -e L2 ■■■e Ln z\ 

CO T S2 T 

Aq).~ ^ ^ ■ ■ ■ L 



=n+l [n)=q-l 2 

B(x, s 2 ,...,s r 



q=n+l\ r \=q q=n+l [n]=q-l S2 ' ' ' ' S 



< E ^ Ar+ E 



q=n+l [n]=q-l S 2- ' ' ' S n . 



= E E V S l 2 ---V S n-Ms2,...,S n )\. (5.1) 

q=n+l [ n ]=q-l 

For the inequality we have used Lemma 3.2 and the condition on the sums over s, and 
for the last equality Eq. ( ^.19[ ). Note that R(n) is not necessarily defined on the entire 



domain T>. This question will be addressed later in this section. We could now substitute 
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the result of Theorem 4.1 in the last expression in Eq. ( |5.1[) to get an explicit estimate for 
R(n). The resulting expression, however, involves an infinite sum (where each term is very 
complicated). We thus use additional inequalities to obtain a closed-form estimate for R(n). 

The first to be simplified is the ratio of T functions occurring in Q. (It is not fruitful to 
use Eq. ( [4.4p here because the condition on the sums over s is different.) For 1 < m < n — 1 
the following holds: 

r(s m+ l + ^gj _ , Qm ^ (n Qmx 

=T77t — : — l^m+l — 1 -\ {S m+ i — Z -\ ) ■ ■ ■ 

r = m m m 

V m I 

'ms m+ i -m + Q m ) (ms m+1 - 2m + Q m ) ■■■Q m 



m Sm + 1 

< — - — (q -m){q- 2m) ■ ■ ■ (q - s m+1 m) 

111 m 1 

1 (g-1)! 



(5.2) 



m s m +i (q-1- s m+ x)\ 

The first inequality relies on the relation ms m+ i + Q m < q. Using ( |5.2|) and (|4.2|) , the sums 
over s in Eq. (|5.1| ) become 



E ^•••e_ 1 ^( S2 ,..., Sn ) 

[n]=g-l 

w ^_ inS2(n _i )S3 ... 2s „ 

" 1 ' w t^ V s 2 ; n« ^ s 3 ; [2(n - 1)]- ^ s n ; [2{n - l)p 



t^o \ s 2 7 V s 3 / [2(n - l)h V s n / [2(n - 1)]« 



S2=0 s 

■ n-1 | -| o-l 



L/=i v J(^-J + 1) 

< (^^i/i^o^-^i + I)("-i)(«-i) 

n 

< {Ke2^h n „ x ) q - X . (5.3) 

The first inequality follows from Lemma 4.1 and the condition on the sum, the second one 
from Eq. (|5.2j ), the third one is evident upon an examination of the ranges of indices S2 
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through s n subject to the condition [n] = q—1, whereas the last one is clear from the relation 
+ < (1 + -) n < e. Finally, as mentioned in the proof of Theorem 4.1, j3 q is bounded 

by an exponentially growing function of q, 

< (§r\ (5.4) 

for some c G M + . Consistent with Eq. (|2.5| ) we have taken f3\ = 1. In terms of c, K 
satisfies K < c. 

From inequality (|5.3|) , and using Eq. (|5.4|) , we then obtain the following closed-form 
estimate for R(n): 

R(n)<- £ (§) E (^^r^) 9 (5.5a) 

c g =n+i 3 a ce2 «-i /l n _i g= n +l 

x n+1 c n x"+ 1 (ce2^i/i w _ 1 )" def 

H — = R (5.5b) 



3ffl-^) l-cxe2^*h n -i 

Note that Eq. ( |5.3| ) or Eq. ( |5.5a| ) explicitly demonstrates the expected result that regardless 
of the dependence of i] n on n, one can always choose x sufficiently small to guarantee that 
M. n z is a convergent transformation (that is that the sums of the form ( |2.4| ) acting on z 
converge). The condition for convergence is 

, (5.6) 

ce2™-i h n -i 

which shows the shrinking of the lower bound on the domain of analyticity of M. n z with 
increasing n. (Evidently, ( |5.6| ) is more restrictive than z G T>. For the latter case it is 
sufficient that x < 3s/c.) 

Since the leading order term in Aiz is z, which is of order x, it is useful to divide R* 
by x, so that the resulting quantity R*/x can be compared to one. Note also that R*/x 
depends on c and x only through the product cx, which we denote by x. 

The question of optimum truncation can now be formulated as follows: given x, find the 
value of n where R*/x reaches its minimum and find the value of the minimum. Eq. (|5.5b ), 



however, is too complicated to carry out the required calculations analytically. Instead, we 
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have to rely on (straightforward) numerical computations. Figure 1 shows the value of n 
where R*/x reaches its minimum as a function of — log 10 (x). This value of n is denoted by 
n m ; n . Figure 2 gives the base-10 logarithm of the value of the minimum, again vs. — log 10 (x). 

If we leave rigor aside, we can obtain analytical expressions that closely approximate the 
solid curves in Figures 1 and 2. First we assume that the minimum of R*/x is determined 

primarily by terms of lowest order in i, x n and then neglect the term (x3~^) n compared 

i i n 

with [xeT^hn-xf 1 . The latter step is justified when (e2 n - 1 h n -i) n ^> 3~3 . We are thus led 

to examine the quantity 

r*(n) def 



x 



{xe2^h n - x ) n . (5.7) 



The location and value of the minimum of r*/x have been determined numerically and 
found to be in excellent agreement with the location and value of the minimum of R*/x, the 
agreement improving with decreasing x. To obtain an analytical expression for the minimum 
of r*/x, however, additional approximations are needed. 

We take the logarithm of — , consider n a continuous variable, differentiate with respect 
to it, and find the location of the minimum by setting the result equal to zero. The derivative 



is 



d x - = 1 + logx- lQg ^ 9 +log/i n -i +n-^-log/i n _i. (5.8) 
dn [n — \) z dn 

In order to evaluate the last term in (|5.8|) we use the identity 

** -3 1 /-n-3 r n-3 1 

E W = + °{n " 3)] + / 0(t)dt + (t-[t\- -)f'(t)dt, (5.9) 

fc =o 2 Jo Jo 2 

where 6(k) is given by Eq. ( 4.23b| ) and [t] stands for the integer part of t, which is valid for 

integer n. Then we define log /t„_i for noninteger n to be the right side of this expression 



evaluated at noninteger n. Substituting ( |5.9|) into (|5.8|), using the asymptotic expansion of 
the form ( |4.25 ) for the non-differentiated log h n -i, and replacing n — [n] by its average value 
of |, yields 

dl °f r ~^ ~ hhgn) 2 + (1 + log 2) hgn + \ogx + c 4 - ^ + O(^). (5.10) 
dn 2 2n n z 
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The constant C4 is denned by C4 = logC3 + 1 + log 2. Both this one and the asymptotic 
expressions that follow are valid for large values of n. (As is evident from Eq. ( 5.11j) , and 



is to be expected, for the minimum of — this is equivalent to x — > 0.) For brevity we omit 
writing down explicitly n —>■ 00 after each asymptotic relation. 

Setting the right side of ( |5.10| ) equal to zero and neglecting terms of order gives 
easily 



' ~ Int{ exp [ - 1 - log 2 + ^(1 + log2) 2 - 2 log x - 2c 4 ] 

+ 1 }. (5.11) 

2 J {I + log 2) 2 - 2 log x - 2c 4 J 



Here Int stands for the integer closest to the real number enclosed in the braces and we have 
denoted the integer nearest to the zero of (|5.10| ) by n' . The right side of Eq. (|5.11| ) agrees 



very well with the numerical results obtained for — : of the 91 points included in Figure 1, 
the two functions differ by 1 at only one point. The exponential term alone of Eq. (|5.11| ) also 



agrees very well with the numerical results. We have included the correction, however, to 
ensure that the expression for the value of the minimum is correct through constant terms. 

The asymptotic expansion for — follows from the asymptotic expansion for log h n -\. The 
result is 

- ~ (xec 3 ) n A /^n n(los2+ 5 lo s") x exp fO(— )]. (5.12) 
x V 2n n 



It is now straightforward to subsitute the right side of Eq. ( |5.11|) into the right side of Eq. 



( |5.12j ) and obtain an explicit expression for the minimum of — . The result is a lengthy 
formula which we do not reproduce here. It provides, however, a good approximation to 
the numerical result. Figure 3 shows the difference between the base- 10 logarithm of this 
analytical formula and the numerical result, as a function of — log 10 x. 

Eq. ( |5.12 ) is valid for any value of n, not only at the minimum of — . Should we wish 



to use Eq. ( |5.10| ) to simplify ( |5.12| ) at the minimum we could, but care should be taken to 
include the fact that n' is the zero of the right side of ( |5.10| ) rounded to the nearest integer. 
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The difference between n' and the actual zero is 0(-), which in — gives corrections of order 
one. Here we then merely note that at the minimum 



In Theorem 4 we have given an upper bound on the norm of vector fields L n which 
are computed by requiring that M.z agrees order by order with a given convergent power 
series. The bound grows with order more rapidly than the exponential function. Thus 
we cannot use the results of Ref. f| to ascertain the existence of a finite domain in x for 
which Ai n z is convergent and analytic as n — > oo (analyticity follows from the analytic 
nesting of domains of successive Lie transformations - see Eq. (3.1) in ||). Instead, we have 
sought to optimize the difference between M. n z and P{z) as a function of n. While an exact 
analytical expression for the minimum of a bound on the difference proved elusive, at least 
without significantly weakening the bound or the results of Theorem 4, we have given an 
asymptotic expression which is valid when the minimum occurs after a large number of Lie 
transformations. Comparison of asymptotic and numerical results, however, showed close 
agreement between the two even for values of x for which it is optimal to use a relatively 
small number of Lie transformations. (For example, the results agree to better than 5% for 



It seems well worthwhile to explore now whether the procedure developed in the preceding 
sections can be adapted to Hamiltonian normal form calculations and used to strengthen the 
estimates of the type given in Ref. Q for the norm of generating polynomials. It would also 
be interesting to examine if the absence of convergence of M. n z as n — > oo is only apparent, 
due to estimates that were used to obtain a rigorous bound, or is the true property of Lie 
transformations computed from a power series. The first step in this direction may be the 
numerical computation of M. n through a large value of n for some representative P{z). 




(5.13) 



VI. SUMMARY 
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Note that the calculation of coefficients a^, , using Eq. ( 2.1 1| ) or an equivalent, requires only 



algebraic manipulations, as all derivatives act on powers of z. 
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APPENDIX: PROOF OF LEMMA 3.1 

We first prove the relation (|3.6| ) by induction. For s = 0, m(7,0) = I, c t k{F, 0) = 
and -8(7,0) = (pi. Therefore (|3l6|) reduces to inequality (|3.4j ). For s = 1 we have 



j=l OZ 3 \i\=l 



E E E aStffSirf* ■ ■ ■ *r 3 ~ l ■ ■ ■ * d d+r \ (Al) 

3=1 \r\=n \i\=l 



which yields 



\ E M F > < max ^E E E l4?ll/ifcfe 

V \t\=m{l,l) J V j=l \r\=n \i\=l 

( d 

< a„max I £ £ 17** |*j 

S=l|i|=Z 



= aj max ( ^ IfSl ) 

k \i\=l ) 

< aJ4>i 

= B{1,1), (A2) 



as needed. Next, assume that (|3.6| ) holds for a fixed s. Then 



j=l az i |£|= m (7, s ) 

EE E a tj c tk{F, s)tjz\ 1+ri ■ ■ ■ Zj +rj ~ 1 ■ ■ ■ z l ^ +rd , (A3) 

j=l \r\=n \t\=m(l,s) 
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and we have 



V \c tk (F,s + l)\) <max(X: E E I^IM^ s)|t 



max _ 

\t\=m(l,s+l) ' x i=l |r|=n \t\=m(l,s) 



< a n max 

A; 



E E MF,s)\t 3 ) 

j=l \t\=m(l,s) J 



= a n m(l, s) max f E l c tfc(-^ ? s ) 

|t|=m(Z,s) 

< a n m(l, s)B(l, s) 

= B(l,s + l). (A4) 

The last inequality uses the induction assumption, whereas the last equality follows from 
the recursion relation satisfied by B. This completes the proof of relation ( |3.6| ). 

It is now straightforward to establish ( |3.7| ). We proceed again by induction. For s = 
we use inequality ( |3.4| ) and the special values of m, c, and B given at the beginning of the 
Appendix to see that ( |3.7| ) holds. For the case s — 1, on the other hand, we use relations 
( |AT| ) and ( |A2[ ) (second inequality) to get 



I r ir( l )f v \\\ — II VnWfWj v h+ri Ji+ r i- 1 id+r d \ 

\ Lj nf k \Z) || — || 2^ 2^ 2^ a rj Jik l i Z l " ' Z j ' " Z d I 



j=l \r\=n \i\=l 



<max EE El^ll^Ni ^ 1 

V j=l | r ]=n |i|=Z y 

< 5(/,l)a; m(w) . (A5) 

The first inequality is evident from the definition of the norm and for the last relation we 
have relied on the fact that m(l, 1) = n + I — 1. Assume now that (|3.7|) holds for a fixed s. 
With the help of relations (|A3|) and (|X4]) (second inequality) we get 

\\L s n +1 Fi l \z)\\ = HE E E o^c tk {F,s)t^...^- 1 ...^ 

j=l \r\=n \t\=m(l,s) 

< T (EE E \a^\MF,s)\ tj )x m ^ + ^ 

^ j=l \ r \=n \t\=m(l,s) 

< B{l,s+ l)x m(l ' s+1) . (A6) 

We have again made use of the recursion relation for m, m(l, s + 1) = m(l, s) + n — 1. This 
completes the proof of inequality (|3.7|). 
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Figure 1. n n 



vs. 



Figure Captions 

\og 1Q x. The step size in log 10 a; is 0.1. 



Figure 2. The value of log 10 R*/x at n = n 
as in Figure 1. 

Figure 3. The quantity A(x) = \og 1Q R*/x 



vs. 



log 10 x. The step size in log 10 a; is 0.1, 



vs. 



log 10 x. Here r*Jx 



-log 10 r*/x 

denotes the right side of Eq. Q5.12 ) and n' a the right side of Eq. ( ^.11| ). The step size in 
log 10 x is 0.1, as in Figures 1 and 2. 
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